Novelty Detection Model Selection Using Volume Estimation
نویسنده
چکیده
In this paper, we present an approach to selecting models for novelty (outlier) detection. Our approach minimizes the risk of accepting outliers at a fixed normal rejection rate, under the assumption that the distribution of abnormal (outlier) data is uniformly distributed in some bounded region of the input space. This risk is minimized by selecting the model with the smallest volume acceptance region, using a randomized volume estimation algorithm. The volume estimation algorithm can estimate the volume of a body in high-dimensional space and scales polynomially in dimension with the number of calls to the model. We have performed extensive experiments which show that the combined model selection criteria are able to select not only the best models from a given model class, but also among all model classes. Novelty Detection Model Selection Using Volume Estimation Edward Meeds Department of Computer Science, University of Toronto
منابع مشابه
Geometrical and computational aspects of Spectral Support Estimation for novelty detection
In this paper we discuss the Spectral Support Estimation algorithm [1] by analyzing its geometrical and computational properties. The estimator is non-parametric and the model selection depends on three parameters whose role is clarified by simulations on a two-dimensional space. The performance of the algorithm for novelty detection is tested and compared with its main competitors on a collect...
متن کاملThe Minimum Volume Covering Ellipsoid Estimation in Kernel-Defined Feature Spaces
Minimum volume covering ellipsoid estimation is important in areas such as systems identification, control, video tracking, sensor management, and novelty detection. It is well known that finding the minimum volume covering ellipsoid (MVCE) reduces to a convex optimisation problem. We propose a regularised version of the MVCE problem, and derive its dual formulation. This makes it possible to a...
متن کاملAdaptive Mixture Discriminant Analysis for Supervised Learning with Unobserved Classes
In supervised learning, an important issue usually not taken into account by classical methods is the possibility of having in the test set individuals belonging to a class which has not been observed during the learning phase. Classical supervised algorithms will automatically label such observations as belonging to one of the known classes in the training set and will not be able to detect ne...
متن کاملLDA Topic Model with Soft Assignment of Descriptors to Words
The LDA topic model is being used to model corpora of documents that can be represented by bags of words. Here we extend the LDA model to deal with documents that are represented by bags of continuous descriptors. Given a finite dictionary of words, our extended LDA model allows for the soft assignment of descriptors to (many) dictionary words. We derive variational inference and parameter esti...
متن کاملThe Effect Of Smoothing In Language Models For Novelty Detection
The novelty task consists of finding relevant and novel sentences in a ranking of documents given a query. In the literature, different techniques have been applied to address this problem. Nevertheless, little is known about Language Models for novelty detection and, especially, the effect of smoothing on the selection of novel sentences. Language Models can be used to study novelty and releva...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005